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A  LIKELIHOOD  RATIO  TEST  REGARDING 
TWO  NESTED  BUT  OBLIQUE  ORDER 
RESTRICTED  HYPOTHESES 


\ 


Giles  Warrack  and  Tim  Robertson 


SUMMARY 

In  an  article  in  the  Journal  of  Psychiatric  Research,  [Cadoret, 
Woolson  and  Winokur,  1977]  consider  two  theories  regarding  the  genetic 
makeup  of  patients  suffering  from  unipolar  affective  disorder.  These 
two  theories  imply  nested  but  oblique  order  restrictions  on  the 
parameters  of  a  statistical  model.  A  likelihood  ratio  test  for  these 
two  restrictions  is  studied. 


1 .  Introduction. 


In  this  paper  we  consider  a  hypothesis  test  in  which  both  the  null  and 
the  alternative  hypotheses  impose  an  order  restriction  upon  a  set  of  para¬ 
meters.  Specifically,  we  deal  with  a  situation  in  which  we  take  independent 
samples  from  k  populations,  where  each  population  is  distributed 
according  to  a  particular  member  of  an  exponential  family  (we  also  consider 
sampling  from  a  multinomial  population).  If  we  denote  by  0^  the  para¬ 
meter  of  interest  from  the  i1"*1  population,  we  consider  hypotheses  Hq 
and  H^,  where 


(1.1)  H0:  -  e2  -  ...  +  ek[  >  8ti+1 


ek  1  •  •  •  1  ek  +1 
K2  K«,-l 


(1.2)  Ha:  >  02  ...  >  ek  >  ek  +1  1  • • *  >  • • •  1  • • •  1  ^ • 


We  note  that  H.  contains  Hft,  and  we  wish  test  H„  against  H  -H_.  (H. 

A  0  0  A  0  A 

but  not  Hq) . 

This  investigation  was  stimulated  partly  by  a  problem  encountered 
in  psychiatric  research,  [winokur  et  al.,  1971 ]  studied  data  on  psychiatric 
illnesses  afflicting  family  members  of  patients  suffering  from  unipolar 
affective  disorder  (u.a.d.).  This  data  strongly  suggested  a  higher  rate  of  certain 
kinds  of  illness  in  family  members  of  patients  who  were  affected  by 
u.a.d.  early  in  life  than  the  rate  for  family  members  of  patients  for  which 
the  onset  of  the  disease  occurred  later  in  life.  This  research  suggested 
genetic  differences  between  these  patients  which  could  be  tested  using 
data  on  psychiatric  Illnesses  suffered  by  family  members.  In  Cadoret  et  al. 
two  different  explanations  (theories)  are  examined  which  might  explain  this 
phenomenon.  The  first  postulates  a  qualitative  difference  whereby 
there  exist  twe  qualitatively  different  genes  or  groups  of  genes  which 


-2- 


explain  the  differences  between  the  early  onset  probands  and  the  late  onset 
probands.  The  second  postulates  differences  caused  by  a  single  group 
of  genes  with  differences  between  the  groups  due  to  the  number  of  genes 
(the  quantitative  theory) . 

Cadoret  et  al.  (1977)  examined  data  on  6  groups  of  patients  suffer¬ 
ing  from  u.a.d.  The  groupings  are  based  upon  the  age  at  which  the  patient 
is  first  affected  by  the  condition.  Conclusions  regarding  the  genetic 
makeup  of  these  patients  were  based  upon,  inferences  about  the  parameters 

p^;  i  =  1,2,..., 6  where  in  one  case  p^  represents  the  proportion  of 

tVi 

alcoholic  fathers  of  patients  in  the  i  age  group  and  in  another 
case  it  represents  the  proportion  of  depressive  parents  of  patients  in 
the  iC^  group.  In  terms  of  these  parameters  the  researchers  quantify 
the  two  theories  as  follows: 


ha:  Pi  i  P2  i  P3  i  P4  i  P5  1  P6 

where  Hq  represents  the  qualitative  theory  and  HA  represents  the 
quantitative  theory.  The  problem  is  to  use  the  data  to  decide  between 
and  Ha-Hq;  thus  this  problem  falls  within  the  general  framework  we  are 
considering. 

The  hypothesis  test  represented  by  (1.1)  and  (1.2)  is  a  special  case 
of  a  large  class  of  problems  in  statistics  where  both  the  null  hypothesis 
and  the  alternative  impose  an  order  restriction  on  the  parameter  set. 
Additional  research  on  order  restricted  hypothesis  tests  where  the  null 
hypothesis  is  composite  can  be  found  in  [Robertson  and  Wright,  198 1 3 
and  [Dykstra  and  Robertson,  1982,1983],  An  interesting  feature  of 
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the  test  regarding  (1.1)  and  (1.2)  is  the  null  hypothesis  behavior  of 
the  distribution  of  the  likelihood  ratio  statistic  which  is  not  what  one 
might  expect  in  light  of  previous  research  in  the  area  of  testing 
statistical  hypotheses  under  order  restrictions.  In  virtually  all  of 
these  previously  investigated  problems  either  the  null  hypothesis  is 
a  similar  region  for  the  likelihood  ratio  test  (cf.  Barlow  et  al.  (1972)) 
or  homogeneity  is  the  least  favorable  configuration  within  the  null 
hypothesis  for  the  distribution  of  the  likelihood  ratio  test  (cf. 
[Robertson  and  Wegman,  1978,  Robertson,  1975  and  Robertson,  1978]).  By 
homogeneity  being  "least  favorable"  we  mean  that  for  all  possible  values 
of  the  parameter  set,  within  the  null  hypothesis  region,  the  likelihood 
ratio  statistic  achieves  its  supremum  when  all  the  6  are  equal. 

We,  however,  obtain  a  rather  surprisingly  different  result  which  is 
described  below. 

Suppose  that  the  0^  are  the  means  of  normal  populations  with 
known  variances,  and  suppose  that  based  upon  independent  samples,  we 
wish  to  test  HQ  against  Ha-Hq  where,  say 


V  61  ®2  "  etj  -  °kj+l  °k 


v  e!  ie2  i  •••  i\+l  i  •••  i  V 


Then  we  are  faced  with  the  problem  that  the  likelihood-ratio  based 
statistic,  L,  is  not  similar  over  the  null  hypothesis  region.  This  time 
the  8  £  Hn  giving  sup  pQfL  >  tD  is  that  vector  such  that  the  difference 

0  ee«o 

between  the  first  components  and  the  second  k-k^  components  approaches 

infinity.  Furthermore,  this  result  enables  us  to  show  that 


(1.3)  sup  P0[L  >  tU  =  P[L'  >  O 


-***>..<*  ■ 
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where  L'  is  the  likelihood  ratio  based  statistic  for  testing  Hq' 
against  -  Hq'  where 


(1.4)  HJ:  6,  -  ...  -  8^  ;  6^^  -  ...  -  ek 


(1.5)  H^:  e,  i  ...  i  ;  eki+1>...>ek 


The  statistic  L'  is  shown  to  be  distributed  as  the  sum  of  two 
independent  random  variables  each  of  which  is  distributed  according  to 
Bartholomew's  chi-bar-squared  distribution.  A  similar  result  is  obtained 
in  dealing  with  hypotheses  of  the  form  (1.1)  and  (1.2),  in  which  we 
test  for  a  descending  trend  between  X  sets  of  parameters,  with  equality 
within  the  sets,  or  "blocks",  against  the  hypothesis  of  a  descending 
trend  both  within  and  between  blocks. 

In  Section  2  we  consider  tests  for  the  normal  distribution,  and  in 
Section  3  we  use  these  results  to  derive  asymptotic  tests  for  the 
exponential  class  of  distributions  and  the  multinomial  distribution. 

In  Section  4  we  illustrate  the  use  of  the  techniques  developed  here  by 
analyzing  one  of  the  data  sets  in  [Cadoret  et  al.,  1977]. 


2.  Tests  for  the  normal  distribution. 

We  begin  by  considering  the  case  in  which  we  sample 
independent  normal  populations  with  known  variances.  We 
Hq  against  Ha~Hq  where 


(2.1)  Hq:  p j  =  u2 


\  1  \  +i  "  •  •  ■ 

i  K1 


V. 


(2. 2)  Ha:  m1>p2  ...  i  i  "k1+l  - 


from  k 
wish  to  test 
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If  we  denote  the  likelihood  ratio  by  A,  then  we  would  reject  Hq  in 

favor  of  H.  for  large  values  of  the  statistic  L  ■  -2  log  A.  We  first 
A 

introduce  some  notation. 

We  take  a  geometric  approach,  and  use  the  symbols  and 

H  to  represent  not  only  statistical  hypotheses  but  also  subspaces  of 
A 

R 

k  -  dimensional  Euclidean  space,  E  .  The  subspaces  denoted  by  HQ  and 

are  closed  convex  cones.  If  x  =  (xj ,X2» • • • .x^)  is  the  vector 

of  observed  sample  means  obtained  from  taking  samples  of  size  n^:  i*l,2,...k 

2 

from  k  normal  populations  with  known  variances  ,  then  the  maximum 

likelihood  estimate  of  y_  =  . .  .wk>  under  HA(HQ)  is  the  projection 

of  x  onto  the  closed  convex  cone  H. (Hq) ,  where 

Hq  -  (y  €  Ek;  y,  -  y2  -  ...  -  '  •••  ’  V 

HA  =  (y  €  Ek:  yt  >  y2  >  ...  1  yk>, 

2 

_  °i 

the  projection  being  taken  with  respect  to  weights  w  -  - 

i 

(see  Barlow  et  al.,  Chapter  2).  This  projection  we  denote  by 

E  (xlH  )  (E  (x|Hj).  The  ith  component  of  this  vector  we  denote  by 
w  —  A  w  —  0 

The  statistic  L  =  -2  log  A  may  be  written  af 

(2.3)  L  -  I  [x  -  E  (x JH  )  ]2  w  -  l  [x  -  E(x|HA)r]2w1  . 
i-1  -  i=l 

Using  the  notation  ||  •  ||  to  denote  the  norm  in  E  associated  with 

k 

the  inner  product  (x,y)  =  £  x.y  v  ,  we  may  denote  L  by 

-  i=l  1  1 

-  ||x  -  e(£|h0)||  l  -  "x  -  e(x|ha)  II  l  . 


(2. A)  L 
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Using  well  known  properties  of  projections  on  closed  convex  cones 
(see  Barlow  et  al.,  Chapter  2)  together  with  some  algebra,  L  may  be 
expressed  equivalently  as 

(2.5)  L  =  || F  (x|Ha)||^  -  !|Ew(x|H0)||  l  . 


The  following  inequality  is  fundamental  to  the  results  that 
follow  and  we  state  it  as  a  theorem. 

—  k 

THEOREM  2.1.  If  6_  t  Hq,  then  for  any  x  c  E  we  have 
(2.6)  L(x  +  6)  A  L(x). 


PROOF.  We  break  up  the  proof  into  two  separate  cases.  In  the 
first  we  consider  the  case  in  which 


(2.7) 


"1 

i“ixi 

ki 

l 

i=l  1 


k 

l  w  x 

>  i=kl+l 


*  "i 


i=kL+l 


> 


the  weighted  average  on  the  left  hand  side  of  the  inequality  sign 
being  the  common  value  of  Ew(x|h^)^,  i  =  l,2,...,k,  and  the  weighted 
average  on  the  right  hand  side  being  the  common  value  of 
Ew(x|Hq)^,  i  =  kj+1 ,kj+2, . . . ,k.  We  can,  without  loss  of  generality  assume 
that  6  =  (6,6, . . .6, 0,0,0, . . .0) ,  where  6  >  0,  and  we  have 


l  w  (x  +  6) 
i=l  1 


(2.8) 


i=kj+l 


"A 


k 

l 

i=l 


1 

Lwi 


i=k1+l 


l  wi 
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The  quantity  on  the  left  hand  side  being  the  common  value  of 
Ew(x  +  <5 ! Hq)  1  £  i  £  k^.  It  is  straightforward  to  show  that 

(2.9)  IK*  +  §)  -  Ew(x  +  6|H0)i|  l  =  ||x  -  Ew(x|H0)||  ^  . 


We  now  utilize  Theorem  2.1  of  Robertson  and  Wegman  (1978),  which  states 
that  if  C  is  any  closed  convex  cone  in  Hilbert  Space,  and  if  _z  €  C, 
then  for  any 

(2.10)  ||(£+  z)  -  E  (^+  zic)(iw  <  ||i  -  E  (Z|C)||  . 

Since  6  £  c  H^,  we  have 

(2.11)  -||(x+6)  -  Ew(x  +  6|Ha)||  2  >-||x  -  Ew(x|HA)||  2  . 


Combining  (2.9)  and  (2.11)  we  obtain,  for  the  case  in  question, 
L ( x  +  6)  >_  L (x)  . 

We  now  consider  the  case  where  x  is  a  vector  such  that 


l  Vi 

1=1  1 


(2.12) 


i=k1+l 


k 

2  Vi 


k 

l 

i=l 


1 

l  Wi 


I  wi 

•  i  A 


i=k^+l 


In  this  case  the 
k 


value 


l  Vi 

i=l  1 
k 

I 

i=l 


Ew(x| H  )  ,  i=l,2, . . . ,kj . k,  will  have  the  common 


Let  e  be  the  positive  difference  between  the  right 
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and  left  hand  sides  of  (2.12)  and  let  £,  t_  €  Hg  be  the  vector  with  e 
as  its  first  k^  components  and  0's  elsewhere.  We  shall  show  first  that 
if  =  (61,61,...,61,0,...,0)  and  0  <_  e,  then 

(2.13)  L(x+6j)  iL(x). 

We  shall  then  in  fact  be  done,  since  x  +  £  is  a  vector  such 
(2.7)  holds,  and  we  will  then  have  for  any  _6  6  that 

(2.14)  L(x  +  6_  +  e)  ^  L(x  +  _£)>_  L(x)  . 

Thus  all  possible  cases  will  be  covered. 

To  prove  (2.13)  we  need  two  lemmas.  The  first  is  obvious,  and 
we  state  it  without  proof. 


LEMMA 

2.2. 

Let 

{x^x^  ...  ,xk)  and  {y1,y2> . . .  ,yk>  be 

two  sets 

of  numbers 

such 

that 

ltliX2i-'V  5rl2yz'-2yk 

and 

yi  "  yi+l  -  xi  “  xi+l  -  0  • 


Then  for  any  set  of  positive  weights  ^wi  ,w2  ’ '  ’ ' ’w]<7 


(2.16) 


k  2  k 

I  w  (y  -  y)  >  l  w  (x  -  x)‘ 
i=l  i=l  1  1 


where  x  and  y  are  averages  taken  with  respect  to  the  weights. 


LEMMA  2.3.  If  x  is  any  vector  in  E  .  and  if  6  €  H. ,  then 

A 


(2.17)  Ewh  +  «|HA)1  -  Eu(k  +  £|Ha)1+1  >  Evfe|HA)t  - 


i+1 


i  =  1,2,... ,k-l . 
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PROOF.  The  proof  follows  from  noting  that  the  sets  on  which 
E  (x  +  6 1 Ha )  assumes  constant  values  are  subsets  of  the  sets  on  which 
Ew(— lHA>  assumes  constant  values,  and  then  using  the  maximum  upper 
sets  algorithm.  (See  Section  2.3  of  [Barlow  et  al.,  1972]). 

We  now  return  to  inequality  (2.13),  and  note  that  in  the  particular 

case  we  are  dealing  with,  using  the  form  (2.5),  L(x  +  6^)  and  L(jc) 

may  be  written  respectively  as  the  weighted  sum  of  squares  of  the 

E  (x  +  6 , I H . ) ,  and  the  E  (x|h.),  around  their  weighted  means, 
w  —  —  1 1  A  i  w  — 1  A  i  ° 

(We  also  use  the  fact,  to  be  found  in  Barlow  et  al.,  that  for  any 

k  k  k 

xfc  E  ,  and  any  closed  convex  cone,  H,  £  w  x  =  1  w-iE  (x|H).). 

i=l  i=l  1  -  1 

Thus  we  may  invoke  Lemmas  2 . 2  and  2.3  to  obtain 
L(x  +  6^)  L(x)  , 
and  the  proof  is  completed. 

The  inequality  obtained  in  Theorem  2.1  enables  us  to  obtain  the 
following  result  regarding  the  distribution  of  the  likelihood  ratio  based 
statistic,  L(X) ,  over  the  null  hypothesis  region. 

THEOREM  2.4.  Define  the  hypotheses  H'  and  H'  as  follows 

U  A 

(2.18)  H^:  Pj  =  u2  =  ...  =  pkj  ;  y^+1  =  ...  =  uk 

(2.19)  Ha:  tijiw2>  ...  1  vki  i  ^+1  1  - 

Let  L'(X)  =  -2  log  A'(X),  A'(X)  being  the  likelihood  ratio  for 

testing  Hq  against  -  Hq.  Then 

sup  p  Cl  >  t]  =  p[l'  >  t]  . 

P€H0  P 
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PROOF.  It  is  clear  from  the  inequality  obtained  in  Theorem  2.1 
that  if  £  =  (y,y, . . . ,y,0,0, . . .0) ,  £  -  and  y  >  0,  then  L(x  +  £> 

is  an  increasing  function  of  y  ,  for  x  fixed.  Since 

(2.20)  CL(X)  >  t]  =  Pq  [L(X  +  £)  >  t]  >  PQ  [L(X)  >  t], 

we  note  that  for  all  t,  P^  Cl(X)  >  t3  is  also  an  increasing  function 
of  y.  Thus  we  may  state  that 

(2.21)  sup  P  Cl(X)  >  t]  =  lira  P  [L(X)  >  tj  . 

y<=H0  P  y-»  p 

It  is  easy  to  show  that  the  expression  on  the  right  is  equal  to 
P[L' (X)  >  t].  To  see  this  we  observe  that  for  any  given  x, 

L(x)  and  L'(x)  will  coincide  if  the  minimum  of  x^,  i  £  is 
greater  than  the  maximum  of  the  x^,  i  >  k^-  As  y  approaches 

infinity,  the  probability  of  this  event  approaches  1.  Hence  we  may 

state 

(2.22)  sup  P  [L(X)  >  t]  =  lim  P  [  L(X)  >  t]  =  pL.'(X)  >  t]  . 

y£HQ  U  y-*°°  W 

The  distribution  of  L'(X)  is  easy  to  derive,  and  is  independent 
of  £  t  Hq.  To  see  this  we  let  hj(X)  and  L^CX)  be  the  likelihood 
ratio  based  statistics  for  testing 

Hoi:  pi  =  "•  = 

against 


AT 


*1  -  m2  - 


-  V 
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and 

H02:  ^+1  =  “  \ 

against 


V+l  -  * 


‘  -Pk 


(Lj(X)  =  -2  log  Aj(X),  and  I^OO  =  -2  log  AjOO,  the  A^  being  the 

respective  likelihood  ratios).  Clearly  L'(X)  “  L^(X)  +  L^CX).  and 

I>2  are  independent,  since  is  a  function  of  the  first  k^  variables 

only,  and  a  function  of  the  second  k-k^.  Furthermore,  both 

and  L2  are  distributed  according  to  Bartholomew's  chi-bar-squared 
distribution  (see  Barlow  et  at.,  Chapter  2). 

Arguing  as  in  Section  5  of  Robertson  and  Wegman  (1978)  we  find  that 
the  null  hypothesis  distribution  of  L'  is  a  chi-bar-square  distribution 
where  the  coefficients  on  the  various  chi-square  tail  probabilities 
can  be  found  by  convoluting  two  sequences  of  level  probabilities 
corresponding  to  linear  order  restrictions.  Specifically 


(2.23)  p[l'  =  0]  =  P^l.kj)  P2(l,k-k1) 
and 

k  ? 

(2.24)  P[L '  >  t]  =  l  Q  P[xT  -  >  t]  ;  t  >  0 

i=3 


where  P^ij.k^)  is  the  probability  that  E  (x|.H^j)  assumes 

distinct  values  amongst  its  first  k,  components  and  P2(X2>k-k^) 

is  the  probability  that  E  (XjH^)  assumes  X 2  distinct  values  amongst 
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its  second  k-k^  components  and  the  sequence  £ 0^3 i-2  is  the  convolution 

of  the  two  sequences  [P^Xj.kj)  ;  X -  l,2,...,kj]  And 

{?2^2,lt-*CP  ’  ^2  =  •  •  »k-kjj  •  The  probabilities  Pj(X^,kj) 

and  ?2(X2,k-k^)  are  computed  undi  the  assumption  that  Hqj  and 

Hq2  are  true.  Algorithms  for  their  computation  are  discussed  at  length 

in  [Barlow  et  al.,  1972]  and  an  approximation  is  given  in  [Robertson 

and  Wright,  1983]. 

Thus  far  we  have  dealt  only  with  the  case  in  which  the  null 

hypothesis  postulates  a  descending  trend  between  two  blocks  of  parameters. 

We  may  in  fact  extend  our  result  to  cover  an  arbitrary  number  of  blocks 

of  parameters.  For  example  consider  the  problem  of  testing  Hq  against 

H.-H  where 
A  0 


(2.25)  Hq:  -  V2  -  ...  »*ki  *  ~  uk2  2  -  Ukje_1+1 

(2.26)  Ha:  Pj  i  ...  iy  1  ...  1  ...  1V 

Here  the  null  hypothesis  postulates  equality  within  X  sets  of  parameters 
and  a  descending  trend  between  blocks,  while  postulates  a  descending 

trend  both  within  and  between  blocks. 

Theorems  2.1  and  2.4  have  natural  extensions  for  this  situation, 
and  in  that  regard  we  define  the  hypotheses  Hq  and  H^  as  follows: 


* 


(2.27)  H^:  Pj  =  P2 


kx+l 


(2.28)  HJ^:  Pj  1  u2  > 
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1 

J 

) 


Analogous  to  the  inequality  for  two  blocks  proved  in  Theorem  2.1,  we  may 
use  induction  to  obtain  for  each  x> 


(2.29)  L(x)  s  L ' (x) 


where  L  and  L'  are  the  likelihood  ratio  based  statisticis  for  testing 
Hq  against  ha~Hq  and  hq  against  H^-Hq,  respectively. 

Similarly,  using  the  inequality  we  may  show  that 


(2.30)  sup  P  fL(X)  >  t]  =  P  {l'(X)  >  t]. 
^H0  ~ 


The  distribution  of  L'(X)  may  be  easily  shown  to  be  distributed  as  the 
sum  of  Z  independent  random  variables  each  distributed  according  to  a 
version  of  Barthomew's  chi-bar-square  distribution.  Thus,  we  may  state 


(2.31)  P[L*(X)  >  t]  =  l  Q  P[X<  ,  >  t]  For  t  >  0 

j=X+l  2  2~ 


z 

(2.32)  P[l’(X)  =  0]  =  n  P.(l,k  -k  ) 

i=l  1  1-1 


where  the  sequence  { Q . 3  is  found  by  convoluting  Z  sequences  of 

3  j-* 

level  probabilities  as  in  (2.23)  and  (2.24). 


3.  Asymptotic  tests  for  the  exponential  family  and  the  multinomial  distribution. 

Because  the  analysis  in  this  section  so  closely  parallels  the  work 
in  [Robertson  and  Wegman,  1978]  and  in  [Robertson,  1978]  we  present 
only  sketches  of  the  arguments.  We  first  consider  an  exponential  family 
£f(*  ;©,t)3  which,  following  Lehmann,  1959  ,  we  parameterize  as  follows. 


* 


J 
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(3.1)  f (x;  0,x)  =  expCp1(e)p2(x)k(x)  +  s(x,t)  +  q(6;t)D;  6  6  »®2) 5  x  6  T 

The  following  assumptions  are  made: 

(3.2)  p^(*)  and  q ( * ; x)  both  have  continuous  second  derivatives  on  some 
interval  (S^.e^)  for  all  x  €  T. 

(3.3)  pj(6)  >  0  for  all  0  £  (0^,02)  and  for  all  x  €  T. 

(3.4)  q*  (8;x)  =  -0  p|(0)p2(t)  for  all  0  €  (0^,02)  and  for  all  t  €  T. 

We  are  thinking  of  x  as  fixed  so  that  all  derivatives  are  with  respect 
to  0  .  Using  this  parameterization  we  have 

(3.5)  E[k(xS  =  0 

(3.6)  var  [k(X)3  =  [pj (0) p2 (t) U-1. 

We  consider  the  problem  of  testing  HQ  against  HA-HQ  where 
we  sample  from  each  of  k  independent  population  and 


V  01  e2  -  •••  -  eki  1  eki+1  -  •••  =  0k 


ha:  0,  >  e9  >  >0,  >  0.  , ,  >  ...  >  ...  >  0, 

A  1  -  2  -  -  kx  -  kx+l  -  k 


(In  this  section  we  consider  only  two  blocks.  Extensions  of  the  results 
presented  here  to  an  arbitrary  number  of  blocks  are  straightforward.  In 
the  analysis  of  the  unipolar  affective  disorder  data  in  Section  4  the 
distribution  is  binomial.) 
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Suppose  we  take  a  sample  of  size  ni  from  the  ith  population: 
1  *  1,2,... ,k,  and  letting  A  be  the  likelihood  ratio,  we  have 


(3.7)  L  -  -2  log  A 
k 


'  2  J1(ni8iP2(Ti)i:pl(ei)"  Pl(V0lVi>^ 

+  ni:q(0i;Ti)-  q(E^(e|H0)1;  Tl)]) 
k 

+  q(Ew(e|HA)i; 


where 


n. 


«.8>  ;t-j-  iuV 


i 

’  1 
‘i  j*l 


and  the  weights,  w^,  are  given  by 

k 


(3.9) 


—  1 

wi  =  tn1P2(T1)]  •  [  l  njP2(Tj)D  . 


Using  a  second  degree  Taylor  series  expansion,  and  equition  (3.4)  we 
may  write  L  in  the  form 

(3.10)  L  -  -  {  ^  n1C0iP2(Ti)p1”(a1)  +  •  [0^(0  Ih^]2} 


-  {-Z  n1C01P2(T1)p1"(c1)  +  q"(di;T1):.[0i-E^(0|HA)1]  } 


I 


Where  a±  and  lie  between  0i  and  E^olH^,  and  c±  and  dt 

lie  between  0.  and  E  (Q I H. ) . . 

i  w  1  A  i 

Using  the  results  of  the  previous  section  we  obtain  the  following 
result  regarding  the  asymptotic  distribution  of  L. 


Theorem  3.1 


(3.11)  sup  lim  P[L  >  t]  =  P[T’  >  t[] 


n^x® 


where 


T'  ■  l  -  E^xlH')^2 


-  I  VjPjejP^T^CXj  -  e^xIh;)^2 

and  where  X  =  (Xj,Xj, . . . ,X^)  is  a  k  dimensional  vector  of  normally 
distributed  independent  random  variables  with 


(3.12)  E(X1)  =  0 


(3.13)  var  (X±)  =  ^^(e^p^T^T 
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and 

K 

are  the 

closed 

convex 

cones 

associated  with  the  hypotheses 

»o: 

61  = 

02  ■  . . . 

=  6k  ; 
K1 

6k1+l 

=  ... 

"  9k 

61  i 

0  2  ^  . .  • 

-  6k  5 
K1 

9k1+l 

1  •  •  • 

The  constants  y  are  defined  in  the  following  way. 
are  dealing  with  sets  of  sample  sizes,  (n^}»  i*l»... 


ni 

for  each  set,  —  is  some  constant,  c... 

"j  13 


We  define 


We  assume  we 
,k,  such  that 


min  {n  } 
l<i<k  1 


PROOF.  We  first  consider  some  arbitrary  0^  €  Hq,  and  we 

define  hypotheses  H'^  and  by  dividing  the  hypotheses 

and  H.  in  separate  parts  according  to  the  different  values  assumed 
0  A 

by  0_.  This  is  most  easily  explained  by  an  example:  suppose  we  are 
testing 


V 


91  =  e,  >  S  =  e4 


against 


H 


and  suppose 


01  -  62  -  03  -  \ 
that  8  is  such  that 


0.  >  0. 


-  *»»•' 
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then  we  define  Hq  and  as  follows 

H0:  6!  *  02  ;  03  *  6 4 

»A:  6i  i  62  5  03  i  6 4 
If  e2  =  e3  then  Kq  =  H0  and  = 

Now,  using  techniques  similar  to  those  in  [Robertson  and  Wegman, 
1978]  it  follows  that 

L  j,  hPt(9l)F2(,l)[  ii  -  Ev(il"S>i]2 
'  jj  Vl(VP2<’2>[  -i  -  Eval"A>i]2 

where  X  and  w  are  as  described  in  the  theorem.  We  denote  this 
limiting  random  variable  by  T".  Using  the  results  obtained  in  this 
paper  for  normal  random  variables  we  conclude  that 

lim  PjL'  >  t]  =  P.[T"  >  t]  £  P[Tr  >  t]. 
y  y 


To  show  that  the  supremura  is  attained,  note  that  if  0^  >  0^  then 

HU  coincides  with  H',  H"  coincides  with  Hf ,  and  T"  =  T'. 

0  0  A  A 

In  Section  4  we  use  Theorem  3.1  to  treat  the  unipolar  affective 
disorder  data  mentioned  in  the  introduction. 
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We  now  treat  the  case  in  which  we  are  sampling  from  a  multinomial 
distribution,  i.e.  we  have  a  random  vector  X  where 


fx  (xr 


,xk) 


V 


.  •  •  x  ! 
k' 


x, 
Pi  1 


x„ 
P2  2 


xi 

,p.  k 


k 

£  xi 


i-1 


k 

Ip, 

i-1  1 


If  we  wish  to  maximize  the  likelihood  subject  to  the  restriction 

that  the  p^  satisfy  the  partial  order,  the  maximizing  estimates  may  be 

obtained  by  taking  the  unrestricted  maximum  likelihood  estimates, 

~  x^ 

p.  =  —  ,  and  projecting  them  onto  the  closed  convex  cone  associated  with 
l  n 

the  partial  order.  The  projection  is  taken  using  unit  weights  (see 
Example  2.1  in  [Barlow  et  al.,  1972]). 

We  treat  the  problem  of  testing  against  H^-H^  where 


V  P1  =  P2 


-  V1  =  =  Pk 


V  P1  —  P2  — 


1  Pl 


The  treatment  here  closely  parallels  the  work  in  Robertson  (1978). 

If  A  is  the  likelihood  ratio  ,  and  we  let  L  =  -2  log  A,  then  we 

have 

a  Ic  a  a 

(3.14)  L  =  2  I  np  [In  E  (£ i H  )  1  -  np  [In  E  (jp|H~)  3 

i=l  1  1  i=l  U  1 


where  E(p1h^)  and  E(p|Hp)  denote  the  projections,  using  equal 
weights,  of  £  onto  and  respectively.  Using  second  degree 

Taylor  series  expansion  we  may  rewrite  this  as 
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(3.15)  L  =  l  np  ~2  Cp.  -  E  (pjH  )  ]2  -  £  np  Cp  -  E  (p(H  )  Ij2 

1=1  1  1  01  i=l  1  1 

where  lies  between  p^  and  E  (p|Hq)^,  and  a^  lies  between  p^ 

and  E  (p |H^) . .  Writing  L  in  this  form  enables  us  to  prove  the  following 
theorem,  analagous  to  Theorem  3.1  for  the  exponential  family: 

THEOREM  3.2 

(3.20)  sup  lim  P  CL  >  t]  =  PfT  >  t3 
p€  Hq  n-> °°  P. 

where 

(3.21)  T  -  Pl[2.  -  E^ZlH')^2  -  P1[21  -  EpCZlH;)^2  , 

-  (Z^,...Z^)  being  a  vector  of  k  independent  normally  distributed 
random  variables  with 

(3.22)  E(Zi)  =  0 

(3.23)  var  (Zf)  =  p"1  . 

The  hypotheses  Hq  and  are  defined  by  the  closed  convex  cones 

associated  with 


X 
o  - 

pi  ■  ^  ■  •••  ■  "kj1  Vi  ’ 

...  -  Pk 

u  *  • 

V 

>  Pfc 

-■  v 
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PROOF.  The  proof  is  similar  to  the  proof  of  Theorem  3.1  and  closely 
parallels  the  proof  of  Theorem  2  in  [Robertson,  1978]. 


4.  An  example  involving  the  binomial  distribution. 

We  now  consider  the  problem  discussed  in  Section  1  concerning  the 
two  theories  which  explain  the  age  of  onset  of  unipolar  affective 
disorder.  The  hypotheses  in  question  may  be  stated  as 


V  P1  =  P2  =  P3  -  P4  =  P5  =  P6 


Ha:  ?!  —  ?2  —  ?3  —  p4  —  p5  —  p6 


where  Hn  represents  the  "qualitative"  theory  of  genetic  transmission 
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and  the  "quantitative"  theory.  From  a  mathematical  point  of  view, 

the  most  interesting  data  set  in  Cadoret  et  al.  regards  the 
proportion  of  depressive  parents  in  males  suffering  from  u.a.d..  For 
these  data  the  six  sample  sizes  (i.e.  numbers  of  male  probands  in  each  group) 
are  66,  174,  134,  166,  116,  and  140  and  the  six  relative  frequencies 
are  .212,  .103,  .149,  .145,  .086,  .100.  Any  one  of  several  algorithms 
in  Barlow  et  al.,  1972  can  be  used  to  compute  the  maximum  likelihood 
estimates  restricted  by  and  H^.  Using  the  pool  adjacent  violators 

algorithm,  we  found  that  the  maximum  likelihood  estimates  subject  to 


are  .139, 

.139, 

.139, 

.114, 

.114, 

.114  and  the  estimates  satisfying 

are  .212, 

.131, 

.131, 

.131, 

.131, 

.094,  .094.  The  resulting 

likelihood  ratio  has  a  value  of  5.362. 

In  order  to  compute  a  P-valve  we  must  compute 

3^<  l  +a  <  6  px( V3)P2<V3)  P  tx£1+Jl2-2  -  5*362-1' 

The  values  of  the  level  probabilities,  Pj(i^3)  and  ^2^2’^  depend 
upon  the  sample  sizes  (weights)  and  can  be  found  using  the  formulas 
on  page  140  of  Barlow  et.  al  (1972).  The  resulting  P-value  is  .0648. 

Additional  details  regarding  the  analysis  of  this  data  set  and 
analysis  of  the  other  data  from  Cadoret  et.  al  may  be  found  in  Robertson 
and  War rack  (198 1 ) . 

5.  Comments. 

As  noted  in  the  introduction,  the  null  hypothesis  behavior  of  the 
distribution  of  the  likelihood  ratio  test  statistic  is  not  what  one 
might  expect  in  light  of  previous  research  in  the  area  of  testing 
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statistical  hypotheses  under  order  restriction.  Moreover,  the  test 
statistic  L' ,  which  is  a  likelihood  ratio  statistic  for  testing 

H0!  P1  =  P2  =  P3*  P4  =  P5  =  P6  against  Ha“  H0  where 

H^:  p^  >_  P2  ?3»  P4  i  P5  Pg>  ^as  H^(and  thus  Hq)  as  a  similar 

region  and  has  exactly  the  same  null  hypothesis  distribution  as  L.  This, 

together  with  the  fact  that  L  <_L'  means  that  L'  is  uniformly 

more  powerful  than  L.  This  result  seems  somewhat  paradoxical  in 

that  L'  does  not  account  for  the  prior  knowledge  that  p^  ^  p^.  If 

we  use  L'  to  compute  a  P-valve  for  our  data  set  then  the  observed 

valve  is  5.783  yielding  a  P-valve  of  .0533. 

In  computing  P-values,  power  considerations  may  not  be  too 
pertinent  since  P-values  are  computed  under  the  null  hypothesis  and 
power  says  something  about  the  quality  of  the  test  when  the  alternative 
is  true.  It  seems  to  us  that  the  P-value  computed  using  L  is  rather 
more  representative  of  the  evidence  in  the  data  against  than  the 

P-value  computed  using  L'. 
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